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METHOD AND APPARATUS FOR COORDINATING TEXT 
AND AUDIO EVENTS IN A DIGITAL TALKING BOOK 

[0001] The present invention relates to an apparatus and concomitant 
method for coordinating the text and audio events in a digital talking book. 
Specifically, the present invention provides a method for synchronizing text 
elements of a book with specific previously recorded audio passages stored 
on an analog storage medium. In performing the synchronization function, 
the present invention also provides a flexible graphical user interface that 
allows a user to easily review and modify the synchronized elements. 

BACKGROUND OF THE DISCLOSURE 
[0002] As digital technologies continue to gain wide acceptance, a vast 
amount of previously stored information must be adapted into the new 
digital standards. Such previously stored information includes a vast 
library of existing analog-recorded books. To preserve the huge investment 
in such analog recordings, these recordings are being converted into digital 
format for implementation such as the Digital Talking Book ( DTB) in 
accordance with the "Daisy" consortium specifications. 

[0003] Unfortunately, among other requirements, the Daisy specification 
requires that each text element provides a point of synchronization (i.e., 
"synchronizable element") be associated with a specific recorded audio 
passage ("audio clip"). In a Daisy DTB recording system, this 
S3nichronization information can be captured at the time of recording. 
However, in a system designed to produce Daisy DTBs from existing 
recorded books on analog tape, this approach is very labor intensive and 
impractical. Specifically, in such a system, the text and audio components 
of the Digital Talking Book are produced independently of one another and 
must be married as a separate process. This process is very labor intensive 
and the enormity of the task is further amplified with the existence of 
hundreds of thousands of existing analog recorded books. 
[0004] Therefore, there is a need for an apparatus and method for 
coordinating independently-produced text and audio clip data and for 
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providing an efficient method of making the adjustments needed to produce 
a properly coordinated and constructed digital talking book. 

SUMMARY OF THE INVENTION 
[0005] An embodiment of the present invention is an apparatus and 
method for coordinating independently-produced text and audio clip data 
and for providing an efficient method of making the adjustments needed to 
produce a properly coordinated and constructed digital talking book. The 
present invention employs synchronization files, e.g., a book project 
management (BPM) file and a Time Stamp Data ( TSD) file in coordinating 
the text and audio clip data. 

[0006] In operation, various files, e.g., BPM, TSD, audio and HTML files 
are initially loaded. Next, sjnichronizable elements in the text are 
correlated with the audio clips. For example, if a book section is opened for 
the first time, the present invention will attempt to correlate the 
synchronizable elements in the HTML with the audio clips identified in the 
TSD. The audio clips in the TSD are identified as being either for heading 
(e.g. chapter) announcements or page announcements. 
[0007] Next, the present invention builds links between the HTML and 
TSD documents internally by first correlating all heading elements to 
heading TSD events, and then correlating the page elements that occur 
between those headings with the page events in the corresponding section of 
the TSD. This auto-linking feature is designed to serve as a "rough cut" of 
the text-audio coordination. 

[0008] Next, the present invention adds graphics to the HTML for 
display. Specifically, for each synchronizable element in the HTML, the 
present invention inserts images in its internal representation of the 
HTML. These images identify whether or not the element has been linked to 
a TSD event. Similarly, the present invention adds graphics to the internal 
representation of the audio event data from the TSD file. These images 
identify whether or not the audio event has been linked to a synchronizable 
element. This combination of images and information provides a user 
friendly graphical interface to be deployed, where the HTML (with the 
embedded graphics) is displayed on one side and a list of all TSD events 
(with the embedded graphics) are displayed on the other. This allows the 
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operator to quickly tell at a glance which text elements need to be linked to 
audio events. 

[0009] Finally, the present invention allows the accuracy of the links to 
be verified when the operator activates, e.g., clicks on a linked HTML 
element, thereby causing the associated audio clip to be played. This allows 
the operator to verify that the HTML element has been correctly linked. 
Similarly the operator can click on an event in the TSD list to hear the 
audio clip represented by that event while the associated HTML element (if 
any) is highlighted. If the accuracy of the links requires adjustment, the 
present invention allows various edit functions to be performed, e.g., 
breaking links, adding links, grouping links, adjusting timing of TSD event, 
creating/deleting TSD events editing HTML elements. 

BRIEF DESCRIPTION OF THE DRAWINGS 
[0010] The teachings of the present invention can be readily understood 
by considering the following detailed description in conjunction with the 
accompanying drawings, in which: 

[0011] FIG. 1 depicts a block diagram of the present invention for 

coordinating independently-produced text and audio clip data to produce a 

properly coordinated and constructed digital talking book; 

[0012] FIG. 2 depicts a block diagram of the data structure of a book 

project management ( BPM) file of the present invention; 

[0013] FIG. 3 depicts a block diagram of the data structure of a time 

stamp data (TSD) file of the present invention; 

[0014] FIG. 4 depicts a block diagram of the data structure of a track 
announcement data (TAD) file of the present invention; 
[0015] FIG. 5 is a screen shot of the graphical user interface of the 
present invention; and 

[001 6] FIG. 6 depicts a block diagram of a flowchart of the method of the 
present invention for coordinating independently-produced text and audio 
clip data to produce a properly coordinated and constructed digital talking 
book. 

[0017] To facilitate understanding, identical reference numerals have 
been used, where possible, to designate identical elements that are common 
to the figures. 



M&My002 



DETAILED DESCRIPTION 
[0018] The present invention provides an apparatus and method for 
coordinating independently-produced text and audio chp data to produce a 
properly coordinated and constructed digital talking book. Specifically, FIG. 
1 illustrates a block diagram of the present invention having a pre- 
processing unit 130 and a time offset adjustment controller ( TOAC) 140. 
[0019] In operation, a source of text data on path 110 and a source of 
audio data on path 120 are pre-processed by the pre-processing unit 130 into 
appropriate formats. For example, the text data may comprise one or more 
text documents stored in a word processor format, (e.g., Word or 
WordPerfect). Alternative, the text data may comprise pages of text that 
are to be converted into digital form via a scanner 134. In one embodiment, 
the pre-processing unit 130 converts the text data on path 110 into a 
preferred format, i.e., marked-up text file(s), on path 132 for processing by 
the time offset adjustment controller ( TOAC) 140. Specifically, the text data 
on path 132 can be presented in either HyperText Markup Language 
(HTML) or XML. HTML pages may include information structures known 
as "hypertext" or "hypertext links." Hypertext, within the context of the 
present invention, is typically a graphic or textual portion of a page which 
includes a parameter contextually related to an audio element. By 
accessing a hypertext hnk, an audio clip associated with that hypertext hnk 
is retrieved and played. 

[0020] The marked-up text files may contain the full text of the original 
printed book, or may consist of a subset of this text. For example, one 
typical type of book produced through analog-to-digital conversion contains 
only the major headings and page numbers in the text portion. This has 
been called a "Table of Contents" or "TOC" book. 

[0021] The audio data on path 120 is typically independently-produced 
audio clips that were previously recorded in an analog format. The pre- 
processing unit 130 can also convert this audio data on path 120 into a 
number of different formats , e.g., MP3, WAV and the like, on path 136 for 
processing by the time offset adjustment controller ( TOAC) 140. The audio 
files typically contain the full-recorded text of the printed book. 
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[0022] Finally, one or more synchronization data files are also generated 
by the pre-processing unit 130 on path 134. These synchronization data 
files are used by the time offset adjustment controller ( TOAC) 140 to 
S3nichronize and coordinate independently-produced text and audio clip data 
to produce a properly coordinated and constructed digital talking book. The 
data structures for these synchronization data files are illustrated in FIGs. 
2-4 and are described below. Examples on the methods for generating these 
synchronization data files are disclosed in US patent application entitled 
"Method And Apparatus For Converting An Analog Audio Source Into A 
Digital Format" with attorney docket "M&M/003", which is herein 
incorporated by reference and is filed simultaneous herewith. 
[0023] Thus, it should be noted that pre-processing unit 130 comprises a 
plurality of modules, e.g., an analog-to-digital (A/D) converter 132, a scanner 
134 and any other modules that may be necessary to generate the text files, 
audio files and synchronization data files on paths 132-136. In fact, the pre- 
processing unit 130 can be implemented using a general purpose computer 
(not shown) having a central processing unit, a memory and various I/O 
devices (e.g., similar to that of the TOAC 140 as described below). 
[0024] In one embodiment, the time offset adjustment controller ( TOAC) 
140 is implemented using a general purpose computer having a central 
processing unit (CPU) 142, a memory 144, and various Input/Output (I/O) 
devices 146. The input and output devices 146 may comprise a keyboard, a 
mouse, a modem, a camera, a camcorder, a video monitor, any number of 
imaging devices or storage devices, including but not limited to, a tape 
drive, a floppy drive, a hard disk drive or a compact disk drive. The general 
purpose computer allows a user to produce a properly coordinated and 
constructed digital talking book using the files received on paths 132-136. 
[0025] In the preferred embodiment, various functions of the time offset 
adjustment controller ( TOAC) 140 as discussed below are implemented (in 
part or in whole) by a software application that is loaded from a storage 
device and resides in the memory 144 of the computer. As such, the time 
offset adjustment controller ( TOAC) 140 and associated methods and/or 
data structures illustrated in FIGs. 2-4 of the present invention can be 
stored on a computer readable medium. Finally, it should be noted that the 
general purpose computer of the time offset adjustment controller 140 
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should be broadly interpreted to include one or more personal computers, 
servers, main frames and the like. 

[0026] FIG. 2 illustrates a block diagram of the data structure 200 of a 
first sjrnchronization file, i.e., a "book project management" ( BPM) file of the 
present invention. For the purposes of the TOAC, a "book-project" consists 
of all the files required to construct a single DTB. The data required to 
manage the entire book project is stored in the BPM file. The TOAC uses a 
software application, e.g., an XML application, for processing these data 
files. The document t5rpe definition for the BPM file is given in the 
Appendix. The BPM contains: (1) project metadata 210, (2) a list of the 
project text files 220, and (3) information on which text elements 230 are 
synchronizable and/or navigable. 

[0027] Project metadata comprises certain data about a DTB and its 
source printed book that are required as part of the Daisy DTB specification. 
The BPM contains a list of these metadata items stored in its < meta_entry> 
elements. Specifically, project metadata represents information about the 
talking book and the print version from which it was derived. This includes 
items such as title, author, original publisher, copyright date, language, 
ISBN number of a book , Daisy specification and etc. 
[0028] More specifically, metadata items are items that are used to 
provide additional information about the document in question, but that are 
not necessarily part of the content of the document. For example, the ISBN 
number of a book would be included as metadata of a document, but it is not 
itself part of the content of the document. These metadata items are 
designed to be used by software applications for providing advanced 
cataloging, bibliographic and archival information. The metadata items are 
primarily used for indexing documents, and for providing search 
functionality for information that is not direct content of the document. 
Specification such as DAISY may provide a list of metadata items that are 
required to produce a DAISY compliant talking book. 

[0029] The BPM file includes a list of the marked-up text files that make 
up the book. These are given in the < file_entry> elements, which are listed 
in the order that they are to be present in the DTB. Various attributes can 
be employed to define the name of the source text file, the path or location of 
the source text file and the type of the source text file (e.g., HTML or XML). 
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[0030] Finally, BPM contains information that identifies synchronizable 
or navigation elements. Specifically, the marked-up text by itself does not 
contain within it any specific indication of where sjmchronization is to occur 
with the audio data. However, the markup standards that are used, provide 
a means of identifjdng certain classes of elements (headings, pages, etc.). 
The BPM then contains identifications of which classes of elements are to be 
considered points of synchronization. These are listed in the < smil_sync> 
element, which is a list of < sync_entry> elements identifjang text markup 
types. Not all items that are sjmchronized with the text may be used as 
high-level navigation points. The < ncc_sync> element is a list of 
<sync_entry> elements that identify the text elements that will be included 
in the Navigation Control Center ( NCC) that is part of every Daisy DTB. 
The <ncc_sync> list is always a subset of the < smil_sync> list. 
[0031] FIG. 3 depicts a block diagram of the data structure 300 of a 
second synchronization file, i.e., a "time stamp data" ( TSD) file of the 
present invention. Within each audio recording, there are a number of 
points which are to be synchronized with specific elements in the marked-up 
text. The information about these time points is stored in separate data 
files. The TOAC uses a software application, e.g., an XML application, for 
processing these time stamp data ( TSD) files. The document type definition 
(DTD) for a TSD file is given in the Appendix. 

[0032] The TSD file 300 contains one or more <data> elements 310, 
where each data element contains the time data for a single audio file. 
Various attributes can be employed with the data element 310 for defining 
the name of the audio file and the amount of recorded time in the audio file. 
Each data element contains at least one record element 320 that identifies 
elements of audio that are associated with a given navigation point. 
[0033] Specifically, each audio clip is expressed as a <record> element 
320, containing a unique ID 322, the clip starting time 324, the clip ending 
time 326 and type 328. The ID attribute 322 holds a value of the associated 
navigation point in the source text file. The Starttime attribute indicates 
the time as to where the associated audio segment begins. The Endtime 
attribute indicates the time as to where the associated audio segment ends. 
The type attribute indicates whether the audio is encapsulated (i.e., stops 
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exactly at end point) or open-ended (i.e., continues until the start time of the 
next event exactly). 

[0034] Each <data> element can contain one or more <record> elements. 
The order of the <data> and <record> elements within the TSD file 
represents the order in which the clips are to be presented in the DTB. In 
one embodiment of the TOAC 140, each TSD file is associated with one and 
only one marked-up file. 

[0035] FIG. 4 depicts a block diagram of the data structure 400 of a third 
synchronization file, i.e., a track announcement data (TAD) file of the 
present invention. Specifically, additional audio timing data is stored in the 
TAD file. This file uses the same XML application as the TSD files to store 
information about the recorded analog track announcements. These 
announcements were required in the original analog product, but have no 
specific use in a DTB. However, storing the track announcements from the 
DTB, allows them to remain in the digital audio files so that they can be 
used for future digital-to-analog conversions. The TAD file stores timing 
information that describes the location of the track announcements in the 
original analog product. Each entry in the TAD file describes a single audio 
clip that encompasses one of these announcements. This information can be 
used when creating the DTB SMIL files to omit playback of these 
announcements in the digital product without actually deleting the 
announcements from the audio files themselves. Thus, the audio files 
remain an exact image of the original analog product tracks. This is 
desirable should one wish to use these files as digital masters for future 
analog cassette production. 

[0036] The TAD or Track Announcement Data file is a file that uses the 
exact same file and data structure as the Time Stamp Data file, which is 
described above and in the Appendix. The TAD file provides a location to 
store information about the announcements added during the analog 
recording process. The announcements do not include any of the content of 
the text, and are used solely for user reference information at the beginning 
and end of each track of recording. An example of a Track Announcement 
would be "Tape 2, Track 3, Pages 123 through 145." This information is 
necessary for user navigation in the analog format, and is not applicable to 
the digital format. By capturing this information during Analog-to-Digital 
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conversion, and separating it from the actual content of the book, future 
Digital-to-Analog conversion for mastering tapes from a digital archive is 
simplified. By separating this information from the content of the book, the 
present invention is able to leave it in the audio files, but avoid those 
portions of the audio during playback. 

[0037] FIG. 6 depicts a block diagram of a flowchart of the method 600 of 
the present invention for coordinating independently-produced text and 
audio clip data to produce a properly coordinated and constructed digital 
talking book. Method 600 starts in step 605 and proceeds to step 610. 
[0038] In step 610, method 600 loads various files, e.g., BPM, TSD, audio 
and HTML files. Namely, it is assumed that all the audio , text, and 
synchronization files, e.g., BPM, TSD and TAD files for the book project 
have been generated beforehand by some operations. The BPM file can be 
created by hand from within the TOAC. For example, the user simply fills 
in a form with the appropriate metadata, text file, and synchronizable 
element information. Alternatively, the BPM can be created beforehand by 
some other means, e.g., via the preprocessing unit 130 and simply opened by 
the TOAC operator. 

[0039] It should be noted that the TOAC can operate on one "book 
section" at a time. A book section may consist of a single marked-up text 
file, the TSD file associated with this text file, and all the audio files 
referenced by the TSD file. The TOAC identifies book sections by parsing 
the file list in the BPM. After loading the BPM, the operator may select a 
book section to work with and open it. 

[0040] Once the relevant files are opened, method 600 correlates 
synchronizable elements in the text with the audio clips in step 620. 
Namely, if this is the first time that this book section has been opened, the 
TOAC will attempt to correlate the synchronizable elements in the HTML 
with the audio clips identified in the TSD. The audio clips in the TSD are 
identified as being either for heading (e.g. chapter) announcements or page 
announcements. The HTML should include heading and page elements, 
identified as given in the Daisy DTB specification. 

[0041] In step 630, method 600 builds links between the HTML and TSD 
documents internally by first correlating all heading elements to heading 
TSD events, and then correlating the page elements that occur between 
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those headings with the page events in the corresponding section of the 
TSD. This auto-hnking feature is designed to serve as a "rough cut" of the 
text-audio coordination. 

[0042] More specifically, the TOAC represents the links between HTML 
elements and TSD events by assigning identical ID attributes to each. The 
use of identical IDs allows the TOAC to rebuild its internal table of links 
automatically when a book section is reopened for further editing. This use 
of identical IDs is also how the play back software is able to associate a 
specific text element with its associated audio. 

[0043] In step 640, method 600 adds graphics to the HTML for display. 
Specifically, for each synchronizable element in the HTML, the TOAC 
inserts an image in its internal representation of the HTML. This image 
identifies whether or not the element has been linked to a TSD event. 
Similarly, for each TSD event, the TOAC inserts an image in its internal 
representation of the data. This image identifies whether or not the audio 
event has been linked to a S3Tachronizable element. In one implementation, 
linked elements and TSD events are identified by a green checkmark and 
unlinked elements and TSD events are identified by a red "X". This allows 
the operator to quickly tell at a glance which text elements need to be linked 
to audio events. However, it should be noted that the present invention is 
not so limited and that other graphical schemes or S3Txibols can be adapted 
to the present invention. 

[0044] In step 650, method 600 displays the HTML and TSD. A screen 
display of this implementation is provided in FIG. 5, where the HTML (with 
the TOAC-embedded graphics) is displayed on one side and a list of all TSD 
events are displayed on the other. 

[0045] In step 660, method 600 queries whether the accuracy of the links 
are to be checked. If the query is negatively answered, then method 600 
ends in step 695. If the query is positively answered, then method 600 
proceeds to step 670. 

[0046] In step 670, accuracy of the links can be verified when the 
operator activates, e.g., clicks on a linked HTML element, causing the TOAC 
to begin playing the associated audio clip. This allows the operator to verify 
that the HTML element has been correctly linked. Similarly the operator 
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can click on an event in the TSD list to hear the audio clip represented by 
that event while the associated HTML element (if any) is highlighted. 
[0047] In step 680, method 600 queries whether an edit function is to be 
performed. If the query is negatively answered, then method 600 ends in 
step 695. If the query is positively answered, then method 600 proceeds to 
step 690 where various edit functions can be performed. 
[0048] Specifically, the operator can change the data in a book section in 
many ways, including: 

[0049] a. Breaking hnks: If an HTML element is incorrectly linked to a 
TSD event, the operator can break that link. The TOAC will change the 
display of the associated graphic accordingly. 

[0050] b. Adding links: The operator can link an unlinked HTML 
element to a TSD event. The TOAC will change the display of the 
associated graphic accordingly. 

[0051] c. Group linking/unlinking: If a continuous series of HTML 
elements are to be linked to a continuous series of TSD events, the operator 
can select these two groups and link them in a single operation. Similarly, 
if a series of HTML elements/ TSD events are improperly linked, they can be 
selected and unlinked in a single operation. 

[0052] d. Adjust timing of TSD events : If an audio clip does not start 
and/or end at the correct time to present the heading or page 
announcement, these times can be adjusted by the operator. This can be 
accomplished by an on-screen control dial, or by entering the timings 
directly. 

[0053] e. Create/delete TSD events : If the audio clip for a 
synchronizable element is not identified in the TSD, a new event can be 
created at any point within the TSD to record this information. Similarly, 
unnecessary TSD events can be deleted form the list. 

[0054] f Create/delete HTML elements : If the input HTML is missing 
one or more sjmchronizable items, the operation can insert these at any 
point in the HTML file. In the case of page elements, the TOAC allows the 
operator to enter a range of these in one operation. The HTML elements so 
added can then be linked to the appropriate TSD events. Similarly, HTML 
elements which are not needed can be deleted from the HTML file. 
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[0055] g. Edit HTML elements: If the text of an HTML element is 
incorrect (e.g., a page number is incorrect, or a heading contains a 
misspelling), the operator can edit the text of this element. 
[0056] After all editing functions are performed, method 600 returns to 
step 660, where accuracy can be performed again as described above. 
Otherwise, method 600 ends in step 695. 

[0057] It should be noted that after the operator has finished making all 
necessary adjustments to a specific book section, he or she can create a 
SMIL (Synchronized Multimedia Integration Language) file for this book 
section. The Daisy DTB specification requires the use of SMIL as the 
format for text-audio synchronization data. The TOAC includes the ability 
to generate SMIL files from the TSD files. 

[0058] After all the book sections are completed and all SMIL files have 
been created, the operator can create an NCC for the DTB. The NCC is 
required by the Daisy DTB specification. The TOAC includes the abihty to 
generate an NCC based on data found in the BPM and marked-up text files. 
The NCC, SMIL, marked-up text, and audio files together will now form a 
Daisy-compliant DTB. 

[0059] The DTDs for both the Book Project Management file and the 
Time Stamp Data file are provided in the Appendix below. Descriptions are 
also provided to assist the reader in understanding the DTDs. It should be 
noted that specific structures of the DTDs are implemented in one 
embodiment of the present invention. As such, those skilled in the art will 
reahze that specific structures of the DTDs can be adjusted in accordance to 
a particular implementation and should not be interpreted to limit the 
present invention. 

[0060] Although various embodiments which incorporate the teachings of 
the present invention have been shown and described in detail herein, those 
skilled in the art can readily devise many other varied embodiments that 
still incorporate these teachings. 
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Appendix 

Book Project Management DTD 

<?xml version="1.0" encoding="ISO-8859-l" ?> 

<!ENTITY It "&#60;"> 
<!ENTITY gt ">"> 
<!ENTITY amp "&#38;"> 
<!ENTITY apos "&#39;"> 
<!ENTITY quot """> 

<!ENTITY % text "# PCDATA"> 

<!ELEMENTbpm (meta_info, file_list, smil_sync, ncc_sync, comment?)> 

< [ELEMENT metajnfo ( meta_entry+)> 

<!ELEMENT meta_entry EMPTY> 
<!ATTLIST meta_entry 

name CDATA #REQUIRED 

content CDATA #REQUIRED 

scheme CDATA #IMPLIED 



<!ELEMENTfile_list ( file_entry+)> 

< [ELEMENT file_entry EMPTY> 

<!ATTLIST file_entry 
name CDATA #RE QUIRED 

path CDATA #REQUIRED 

type (XML | HTML) "HTML" 



<!ELEMENT sync_entry EMPTY> 

<!ATTLIST sync_entry 
elem_name CDATA #REQUIRED 

att_name CDATA #IMPLIED 

att_value CDATA #IMPLIED 



<!ELEMENT smil_sync ( sync_entry+)> 
<!ELEMENT ncc_sync ( sync_entry+)> 
<!ELEMENT comment (%text;)> 



- 13- 



M&M/002 



Book Project Management File Description 



Element: BPM 


This is the root element of the Book Project Management 
file. This element also defines the types of additional 
elements that can be found in the file, and in what order 
they appear. 


Element: 
Meta_Info 


This element begins the listing of meta data items in the 
book project management file. This is the first element 
following the BPM element, and contains at least one 
Meta_Entry Element, but may contain more. 


Element: 
Meta_Entry 


This element can be used as part of the Meta_Info 
section of the Book Project Management file. A single 
Meta_Entry element is used for each piece of Meta Data 
to be included in the book project. The attributes of the 
Meta_Entry are used to organize the individual parts of 
a Meta Data entry. 


Attribute Name 


Attribute Data Type 


Attribute Description 


Name 


String 


This attribute holds the name 
that identifies which meta data 
element is being described by 
this Meta_Entry. 


Content 


String 


This attribute holds the value of 
the meta data element that is 
described by the Meta_Entry. 


Scheme 


String 


For Meta_Entries where the 
content can be interpreted in a 
variety of ways, the optional 
Scheme attribute indicates 
which interpretation is required, 
(e.g. Name="Copyright Date", 
Content="02-02-2001", 
Scheme="MM-DD-YYYY" to 
indicate that the Date format is 
Month, followed by Day instead 
of Day followed by Month.) 
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Element: 
File_List 


This element indicates the beginning of the listing of 
"source text"* files included in the project. This element 
follows the Meta_Info section of the Book Project 
Management file, and contains at least one File_Entry 
element. 


Element: 
File_Entry 


This element can be used as part of the File_List section 
of the Book Project Management file. A single 
File_Entry element is used for each "source text" file 
associated with the Book Project. 


Attribute Name 


Attribute Data Type 1 Description 


Name 


String 


This attribute holds the 
name of the source text file. 


Path 


String 


This attribute is used to 
hold either a relative or 
absolute path to the 
location of the file on the 
storage media. If this is a 
relative path, the parent of 
this path is the path 
defining the location of the 
Book Project Management 
file on the storage media. 


Type 


"HTML" or " XML" 


This optional attribute is 
used to identify the type of 
source text file. If no Type 
attribute is specified, a type 
of HTML is assumed. 


Element: 

^j-i-i-iii iDyilL' 


This element indicates the beginning of the listing of 
synchronizable elements within the source text files that 
are used to indicate points of navigation. This element 
follows the File_List Element, and contains at least on 

Sync_Entry element. 

. 
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Element: 
NCC_Sync 


This element indicates the beginning of the listing of 
synchronizable elements of the source text files that are 
to be included in the Navigation Control Center ( NCC) 
file that is defined as part of the DAISY (Digital Audio 
Information System) Standard. This element follows the 
Smil_Sync element, and contains at least one 
Sync_Entry element. The Sync_Entry elements in this 
section can be a subset of the S3mc_Entry elements in 
the Smil_S37nc section. 


Element: 
Sync_Entry 


This element is used to store information about the 
markup elements in the source text files that indicate 
navigation points. The attributes of this element are 
used to organize the individual parts of the Sync_Entry. 


Attribute Name 


Attribute Data Type 


Description 


Elem_Name 


String 


This attribute holds the 
name of the markup element 
that identifies navigation (or 
synchronization) points in 
the source text file. ( e.g. In 
an HTML document the HI 
element can be specified as a 
navigation element.) 


Att_Name 


String 


This optional attribute works 
in conjunction with the 
Att_Value attribute to define 
a specific subset of the 
elements described by the 
Elem_Name attribute. ( e.g. 
In an HTML document, the 
CLASS attribute of the HI 
element can be used as an 
identifying attribute.) 



- 16- 



M&M/002 



Att_Value 


String 


This optional attribute is 
used in conjunction with the 
Att_Name attribute to 
specify a subset of the 
elements described by the 
Elem_Name attribute. If an 
Att_Name is specified, and 
Att_Value can be included 
with it. (e.g. In an HTML 
document, a value of 
CHAPTER for the CLASS 
attribute could define a 
subset of the HI element.) 


Example: <sync_entry elem_name="Hl" att_name="CLASS" 

att_value= "CHAPTER" /> describes the subset of HI elements in the source 

text file with a CLASS attribute whose value is equal to CHAPTER. 


Element: Comment 


This optional element is the last entry in the Book 
Project Management file. This element contains 
text for any comments about this project. 



Notes: * "Source Text" is used to indicate the file that contains the text (or 
portions of the text ) as they would appear within the original printed 
version of the text. This generic term is used to represent any type of file 
that can contain such data. e.g. HTML File, XML File, etc. 
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Time Stamp Data DTD 

<?xml version="1.0" encoding="ISO-8859-l" ?> 
<!ELEMENT tsd (data+)> 
<!ATTLIST tsd 
sourcetype (VRW|AD) #REQUIRED 



<!ELEMENT data (record+)> 

<!ATTLIST data 
audiofile CDATA #REQUIRED 
totaltime CDATA #REQUIRED 



< [ELEMENT record EMPTY > 

<!ATTLIST record 
id ID #REQUIRED 

starttime CDATA #REQUIRED 
endtime CDATA #REQUIRED 
type (e|o) #REQUIRED 

comment CDATA #IMPLIED 



- 18 - 



M&M/002 



Time Stamp Data/Track Announcement Data Files Description 



Element: TSD 


This IS the root element of the Time Stamp and Track 
Announcement data files. This element also defines 
the types of additional elements that can be found in 
the file, and in what order they must appear. 


Attribute Name 


Attribute Data Type 


Description 


Sourcet5;Tje 


"VRW" or "AD" 


This attribute is used to 
indicate what method was 
used to generate the data 
file. The valid options for 
this attribute are VRW which 
indicates the file was 
generated by the Volunteer 
Recording Workstation, or 
AD which indicates that the 
file was generated through 
an Analog-to-Digital 
conversion. 


Element: Data 


This element is used to identify a block of records that 
are all associated with a single audio file. Each data 
file contains at least one Data element, and each Data 
Element contains at least one Record element. 


Attribute Name 


Attribute Data Type 


Description 


AudioFile 


String 


This attribute holds the 
name of the audio file that 
the records are associated 
with. 


TotalTime 


String 


This attribute holds a string 
that indicates, in 
milliseconds, the amount of 
recorded time in the audio 
file. 
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Element: Record 


This element is used to identify elements of audio that 
are associated with a given navigation point. The 
attributes identify which navigation point, and what 
subset of the audio is associated with that element. 


Attribute Name 


Attribute Data T3^e 


Description 


ID 


The 'ID' data type is a subset 
of strings. An ID must begin 
with a letter, an underscore 
(_) or a colon (:). This can be 
followed with one or more 
letters, digits, periods (.), 
hyphens (-), underscores (_), 
colons (:)," Combining 
Characters", or "Extender 
Characters." (For a complete 
definition of the allowed 
values for an ID, please refer 
to 

http://www.w3.Org/TR/2000/R 

EC-xml-20001006#sec- 

attribute-types) 


This attribute holds 
the value of the ID 
attribute of the 
associated navigation 
point in the source text 
file. 






Starttime 


String 


This attribute holds a 
string that indicates 
the time, in 
milliseconds, where 
the associated audio 
segment begins. 


EndTime 


String 


This attribute holds a 
string that indicates 
the time, in 
milliseconds, where 
the associated audio 
segment ends. 


Type 


"e" or "o" 


This attribute contains 
a single character, 
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either e or o, which 
indicates what type of 
audio event this is. A 
type of e indicates an 
Encapsulated 
segment; one in which 
the desired audio stops 
at exactly the end 
point described. A 
type of 0 indicates and 
Open-Ended segment; 
one in which the audio 
should continue to 
play until the start 
time of the next event 
in the list, or to the 
end of the file. 


Comment 


String 


This attribute holds a 
string that can be used 
to add any additional 
comments relating to 
this audio segment. 



Note: The values of the TotalTime, StartTime, and EndTime are actual 
numerical values that would be expressed as a data type of Double Byte 
Integer. They are represented as strings in the Time Stamp and Track 
Announcement Data Files because the definition for XML only allows three 
t3rpes of data; strings, enumerated data (such as found in the Type attribute 
here), or a "Tokenized Type" which defines several subsets of the string type 
that must conform to specific rules. 
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